CDS
Accession Number | TCMCG029C32670 |
gbkey | CDS |
Protein Id | XP_023736009.1 |
Location | complement(join(1054712..1055197,1055276..1055403,1055484..1055580,1055653..1056240,1056323..1056727,1056822..1057021,1057114..1057161,1057276..1057309,1057595..1057726,1057822..1057917,1058060..1058126,1058229..1058284,1058395..1058490,1058619..1059342,1059999..1060282)) |
Gene | LOC111883911 |
GeneID | 111883911 |
Organism | Lactuca sativa |
Protein
Length | 1146aa |
Molecule type | protein |
Topology | linear |
Data_file_division | PLN |
dblink | BioProject:PRJNA432228 |
db_source | XM_023880241.1 |
Definition | DNA mismatch repair protein MSH7 [Lactuca sativa] |
EGGNOG-MAPPER Annotation
Sequence
CDS: ATGCAGCGCCAGAAATCCATACTCTCGTTCCTTCAGAAGCCGAAAATTGAAAAACCGGTCGGCGGTGCAGCTGTCACAGGCGACTCTGAGGCGGTTTCTGAGGAGAAGATACATGGGAGAAATCTACCTTCATCTAATCAGCCTATTATTCACTCATCTGCAGTGGATTTCTCGAATGAAATCATTGGAACGGACACTCCGCCAGAGAAGGAGAAACGCCCGTTATTTTCTAGTATCAAGCACAAGTTCGTCAAGCCTAACAGTGTCGAGAAGCCTCGTGATAGGAATTTGTTAGATAGCAGCTGTGATAATATCTTCTCAATATCTAACAACTGTAGTTACTCTAATGGAAGAGAGAAGCAGGGTTCAGTTTCTAATTTTTCAAAAATGAAAAATGTTTCTGATGTAGAAAAAACAGCTTGTCAAGGGGATAAAGGACATCCCTTGATCATAGAAAGTGATAGTGATATAACAGGGCCAGAAACTCCAGGTGCACAACCACTGATTCCACGTTTGAAGCGAGTCCAGGAAGATGGTTGTACCTTTGGCTTTACTACTGGTACCACTGCTGATTTCTCTATCAACAATAGCAAACGAGTGAAATTTTCTCAAGATTTACCTGCTAAAAACAAGAAAGATGAAGTGGCATCTGACATGCCTATGAACAATAGCAAAAGAGCGAACCTTTCTCATGATTTACTGTCTGAAAACAAGAAAGATGAGGTGGCATCTGAAACGCCTATGAACAATAAAAGAAGTGCATATTTTTCTCATGATTTACCTTCTCAAAACAAGAAAGATGATGTGGCATCTGAAACGGCTAGTAAGTTTGACTGGCTTCATCCTTCACGAATCAAAGATGCTAATGGAAGAAGGCCAAACAATCCTCTTTATGATAAGAGAACACTTTACATACCACCTGATGTTTTGAGGACAATGTCAGCATCTCAGAAACAATATTGGGGTGTCAAAAGTCAATACATGGATGTTCTTATTTTCTTTAAAGTGGGAAAATTCTATGAGCTTTATGAACTAGATGCTGAAATTGGACATAAGGAGCTTGATTGGAAAATAACAATGAGTGGTGTTGGGAAATGTCGACAGGTTGGTATCACTGAGCATGCAATTGATGATGCTATTGAAAAGCTATTGGCTCGTGGGTATAAAGTTGGGCGAGTGGAACAGTTGGAAACATCAGAACAAGCAAAATCGAGAGGATCTACTGCTGTAATTCAAAGAAAATTAGTAAATGTGCTTACACCATCAACATTGGTCAATGGTAACATTGGGCCTCAAGCTGTTCATCTTCTTGCTATAAAGGAGGGTATAAGAAATCTTGATGATGGTTCAACTGCATATGGATTTGCCTTTGTTGATTGTGCTGCTTTACAGTTTTGGGTTGGATCAGTCAGTGATGATGCTTCATGTGCAGCTTTAGGGGCTTTGTTGATGCAGGTGTCTCCTTCAGAAGTTCTTTTTGAAAGTCAAGGATTATCCAAAGAGGCTCAGAAGGCCCTTAACAAGTATTCGTTAACTGGTTCCGTTGCCTCACAAATGACTCCATCAGTTCCAGCCACTGATTTTGTTGATTCCTATGAAGTTCGAACTTTTATTCAAATGAAAGGGTATTTTAAAGGCTCCTCAAATCCATGGGATCTTGCACTTAATCAAGTGGCTCATCAGGATGTTGCATTATGTGCTCTTGGTGGACTTTCCAATCATTTATCCAGGTTAAAGTTGGATGATGCACTAAAGAATGGAAGTATTCTACCCTATGAAGTCTACAGAGGATGCCTTAGAATGGATGGACAAACAATGGCCAATCTTGAAATCTTCAGTAATAATGCAGATGGAGGAACATCAGGAACACTTTTCAAATATCTTGATAACTGCTTAACATTTTCTGGAAAACGACTCTTAAGGAAATGGCTATGTCATCCACTGCAAGATGTTGAGGAAATAAACCATAGGCTTAATGTGGTTGAACAACTCATGGGTCATCCAGATATCATGTCACTTATTTCTCAATATCTCCGGAAGCTTCCGGATTTGGAAAGGTTTTTTGGACAAGTGAAATCCACCTTTCATTCATCTGCTTTGCTTTTATTGCCACTGATTGGAAGCAAAATATTAAAGCAACGAGTGAAAGTATTTGGGTCTCTTGTTAAAGGTTTACGTGTTGGATTGGATTTATTGAAGGTGTTACAAAAGGAAGATCATGTCCTCTCGTTGTTGCTAAAAATATTCAGTCTTCCGATGCTAAGTGGGAACGATGGAATTGATAAATTTCTCACCCAATTTGAAGCAGCTATCGACAGTGATTTCCCAAATTATCAAGCACATGAGATCAAGGATTCAGACGCTGAAATCCTATCAATCTTGATCGAGTTATTCATGGAGAAATCAAATGAATGGTTTCAAGTTATTCTCGCTTTAAACTCCATTGACGTTCTCCGATCTTTTGCTGCCACGTCAAACTTTTCCCGTCTAGCCATGTGTCGACCCGTTATAGTTCCTCGCTCAAACTCGTCGGGTCCCACACTTGATATGAGAGGCTTATGGCATCCGTATGCGCTTGGGGAAACCGGAGGGACACCTGTCCCTAATGACTTGTCTCTTGGAGATAACCAATTTGGTTACAATCCTCGTACTTTGCTGTTGACTGGTCCGAATATGGGCGGGAAGTCAACCCTCCTTCGCGCTACCTGTTTAGCAGTTATCCTTGCTCAGTTAGGTTGCTATGTCCCATGTGAAACATGTGTTATCTCGGCTGCTGATGTTATTTTCACGCGTTTGGGTGCTACTGATCGTATAATGACGGGTGAAAGTACCTTTCTGATTGAATGTACAGAAACGGCCTCGGTTTTGCAAAACGCATCTCAAGATTCTCTTGTGATTCTTGACGAGTTGGGTAGAGGAACAAGCACTTTCGATGGATACGCTATTGCTTATGCTGTGTTTCGTCATCTTGTTGAAAAGGTGAACTGTAGGTTGCTATTTGCCACCCATTACCACCCACTCACGAAGGAATTCGCCTCACACCCTCACGTGACCTTACAACACATGGCATGTGCTTTTGAAAACGTGACATCACTGTCAACCAACAATACCCAGAAGTTGGTGTTTCTCTACCGGCTAACCACCGGTGCATGCCCGGAGAGCTACGGGATGCAGGTGGCGTTAATGGCGGGAATTCCAAAAAAGGTGGTGGAGGCGGCTTGTGAGGCGGGGGAGGTTATGAAAAGAAAGATCGGAGTAAGTTTCCGGTCAAGTGAAAGGCGGTCGGAGTTTTCTACTTTGCATGAGGAGTGGTTGAGGAGTGTTTTGACGGTGTCTAAAGCTGAAGCCCATTGTCTTGGGGATGGTGAAGAAGATGACGTGTTTGATACTCTGTTTTGTTTGTGGCATGAGCTTAAAAGCTCAAACCAAAAGTTGAAGTAA |
Protein: MQRQKSILSFLQKPKIEKPVGGAAVTGDSEAVSEEKIHGRNLPSSNQPIIHSSAVDFSNEIIGTDTPPEKEKRPLFSSIKHKFVKPNSVEKPRDRNLLDSSCDNIFSISNNCSYSNGREKQGSVSNFSKMKNVSDVEKTACQGDKGHPLIIESDSDITGPETPGAQPLIPRLKRVQEDGCTFGFTTGTTADFSINNSKRVKFSQDLPAKNKKDEVASDMPMNNSKRANLSHDLLSENKKDEVASETPMNNKRSAYFSHDLPSQNKKDDVASETASKFDWLHPSRIKDANGRRPNNPLYDKRTLYIPPDVLRTMSASQKQYWGVKSQYMDVLIFFKVGKFYELYELDAEIGHKELDWKITMSGVGKCRQVGITEHAIDDAIEKLLARGYKVGRVEQLETSEQAKSRGSTAVIQRKLVNVLTPSTLVNGNIGPQAVHLLAIKEGIRNLDDGSTAYGFAFVDCAALQFWVGSVSDDASCAALGALLMQVSPSEVLFESQGLSKEAQKALNKYSLTGSVASQMTPSVPATDFVDSYEVRTFIQMKGYFKGSSNPWDLALNQVAHQDVALCALGGLSNHLSRLKLDDALKNGSILPYEVYRGCLRMDGQTMANLEIFSNNADGGTSGTLFKYLDNCLTFSGKRLLRKWLCHPLQDVEEINHRLNVVEQLMGHPDIMSLISQYLRKLPDLERFFGQVKSTFHSSALLLLPLIGSKILKQRVKVFGSLVKGLRVGLDLLKVLQKEDHVLSLLLKIFSLPMLSGNDGIDKFLTQFEAAIDSDFPNYQAHEIKDSDAEILSILIELFMEKSNEWFQVILALNSIDVLRSFAATSNFSRLAMCRPVIVPRSNSSGPTLDMRGLWHPYALGETGGTPVPNDLSLGDNQFGYNPRTLLLTGPNMGGKSTLLRATCLAVILAQLGCYVPCETCVISAADVIFTRLGATDRIMTGESTFLIECTETASVLQNASQDSLVILDELGRGTSTFDGYAIAYAVFRHLVEKVNCRLLFATHYHPLTKEFASHPHVTLQHMACAFENVTSLSTNNTQKLVFLYRLTTGACPESYGMQVALMAGIPKKVVEAACEAGEVMKRKIGVSFRSSERRSEFSTLHEEWLRSVLTVSKAEAHCLGDGEEDDVFDTLFCLWHELKSSNQKLK |